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ABSTRACT 


One recurring problem in military operational test and evaluation is deter- 
mination of the number of items to test. This thesis describes a Bayesian 
method to determine the sample size that is needed to estimate a proportion 
or probability with a (1-x)100 confidence when a prior distribution is given to 
that proportion. It uses the two variants of the triangular distribution as priors 
and develops computer programs, graphs, and tables to assist in finding the 
required sample size. These results are compared with other approaches in 
determining the required sample sizes that are needed to obtain a desired 


confidence interval for a proportion or probability. 
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lI. INTRODUCTION 


In the planning of a sample survey, or in many forms of weapon system 
testing, a stage is always reached at which a decision must be made about the 
size of the sample. The decision is important. Too large a sample implies a 
waste of resources, and too small a sample diminishes the utility of the results. 
The decision cannot always be made satisfactorily, for often we do not possess 
enough information to be sure that our choice of sample size is the best one. 
The topic of this thesis is to respond the question of how many observations 
are necessary for a given degree of accuracy, or how large the sample size 
should be to estimate proportions from a set of Bernoulli trials. 

There are several ways to obtain estimates for unknown parameters. In 
this thesis, we will use the definition of a confidence interval to estimate the 
unknown probability or proportion. “A confidence interval for an unknown 
parameter gives an indication of the numerical value of our unknown 
parameter as well as a measure of how confident we are of that numerical 
value. “[Ref. 1: p. 323 ] Generally, the bigger the sample size used, the shorter 
the confidence interval will be. 

Our major focus throughout this work is to determine the number of 
samples that are needed to produce a desired confidence interval size for a 
proportion or probability. This study investigates the necessary sample size 
that would be used with Bayesian statistical methods that make use of the 
existing experience of the experimenter and his knowledge of the phenomenon 
being studied. The uniform density and beta density functions were used as 
the prior distributions in [Ref. 2] and [Ref. 3] where the sample size question 
based on Bayesian confidence intervals was also studied. The uniform 
distribution on the interval (0.1) does not provide a great deal of flexibility in 
choosing a prior. but, it distributes our ignorance equally. The beta 
distribution with various parameter values allows a better control of the 


decision maker’s prior beliefs and the representation of skewing, but it is 


difficult to translate the decision maker’s knowledge and judgement into the 
distribution parameters. In this work we will use the two variants of the 
triangular density function as our prior distributions, because they allow a 
simple representation of distributions which are either more heavily weighted 
in favor of high values of proportions rather than low values or low values of 
proportions rather than high values. After developing the relationship 
between needed sample size and the decision maker’s prior information 
represented by a triangular distribution, we will provide graphs and tables to 
assist a decision maker in finding the number of samples needed to produce 
a desired confidence interval to estimate a proportion or probability. 

We will begin by discussing various methods that can be used to find the 
number of samples needed, and we will compare these methods. In Chapter 
ll we will describe a method to determine the sample size using classical 
Statistics. in the next chapter we will describe our Bayesian method with the 
prior, sampling, and posterior distributions in order to find the sample size to 
estimate a proportion. We will use the two variants of the triangular density 
function as our prior distributions and the binomial distribution as our 
sampling distribution. Also, in Chapter Ill we will give the derivation of the 
posterior distributions. Then, in Chapter IV we will discuss how we developed 
and how we can use computer programs, graphs, and tabies to determine the 
required sample size to obtain a desired 95% confidence interval for a 
probability or proportion. Also, we will explain the computer programs used 
for the Bayesian results. 

In the final chapter we will Summarize our work, and we will give some 


suggestions for further research. 


Il. DETERMINING THE DESIRED SAMPLE SIZE FOR PROPORTIONS USING 
THE CLASSICAL METHOD 


In this chapter, we will describe how we can use classical methods to 
determine the desired sample size to estimate proportions. 

Statistical methods are concerned with using the numbers observed in a 
sample from the population to make inferences about the population or, more 
specifically, a probability measure of the population. We will study estimation 
methods in this work. Problems of estimation are concerned with calculations 
of the numbers that occur in a sample to guess or estimate the values of 
unknown parameters of the population probability law. First we will find a 
point estimate for proportions. Then we will use this point estimate to find a 
confidence interval which provides an Indication of a precision or accuracy of 
an estimate. Next, we will use this confidence interval to determine the 


required sample size. 


A. POINT ESTIMATE FOR PROPORTIONS 

“A point estimate of an unknown parameter is a number, computed from 
observed sample values, that is used as our guess for the value of the 
unknown parameter.” [Ref. 4: p. 175] To estimate this value, we could use 
any number we like. if we can in some way base the number we choose on the 
results observed in selecting a random sample from the population. 

In our case, we could calculate a point estimate. X/n, for the binomial 
parameter p, where X is the number of successes in n independent Bernoulli 


trials, each having p as the probability of success. That Is, 


n 

A canes X 

baa MAH ie) 
j=1 


This is the point estimate for a proportion p. Then we will use this point 


estimate to establish a confidence interval for the proportion. 


B. THE CONFIDENCE INTERVAL FOR PROPORTIONS 

When n is large, we can use the normal approximation to the binomial. 
That is, forn large and np>5, p has approximately a normal distribution with 
mean p and variance p(1-p)/n . Since n is large, we can approximate the 
variance by p(1—/p)/n. The problem of a point estimate is converted into that 
of finding the confidence intervals for the mean of the normal distribution with 
a known variance. Thus a (1-«)100 percent large-sample confidence interval 
for P is given by [Ref. 5: p. 325] 


A p(1—p) A p(t — p) 
(5-2, 2 POP P+ 2, af ) (2.2) 


For example, suppose that the number of defective items in a sample of 
100 is 10. Then from Equation 2.1 the point estimate is p=——— = 0.1.A 


confidence interval of 90% for a proportion is 


Ordex. 0-9 0.1 x 0.9 = 
(0.1 1.645 x . / 100 DOs eealeo4 Ss xa 100 =f) Syl wes,)). 


This says that as a result of our sample of 100 items, we are 90% certain that 
this confidence interval (0.05, 0.15) contains the true value of our proportion. 
But even when this is done, the desired accuracy cannot be guaranteed. “With 
95 percent confidence” or “with 90 percent confidence” means that the 
confidence intervals computed will in 5 or 10 cases out of 100 not include the 
population parameter; in these cases, the desired accuracy will not be 
attained. The only way of guaranteeing the stated accuracy is to measure each 


item in the whole population. 


C. SAMPLE-SIZE DETERMINATION FOR ESTIMATING PROPORTIONS USING 
CONFIDENCE INTERVALS 
Statisticians are often asked the question, “How many observations should 


| take?” Before this question can be answered, we must know what the 


problem is and what kind of risks the decision maker is willing to take. 
Suppose the decision maker is interested in finding a (1-«) level confidence 
interval of a proportion. Suppose further that he wants a proportion to be 
estimated within A units of the true proportion. In that case, the length of the 


confidence interval desired is 2A. Then our true proportion P will be bounded 


by 
ja) ft el ee Fata 


where A is the maximum error of estimate and 


i ee ee (2.3) 


Hence, we can determine the sample size n by solving Equation 2.3, obtaining 


A (1—p) 
| O ammmamaeaay me 


2 
pis Ze. 
1 Ke 


Ls (2.4) 

2 

Equation 2.4 gives the value of n, and a nearest integer would suffice as the 

sample size. Note that the required n depends on the value of p which we 

must assume before we select the sample from which we will estimate it. We 

cannot use the actual value of p in the calculation because it is still unknown. 
The sample size n given by Equation 2.4 is maximized at p =0.5. Thus for 


worst case planning Equation 2.4 yields, for «=Q.05, 


n= (+% )? (05) (0.5) = a 


Here, for example, if we wish the size 2A of the 95% confidence interval to be 
0.10, then we can find n = 385. 


Values of n given by Equation 2.4 to determine different 95% confidence 
interval for different proportions can be found by using Table 1. For example, 
suppose that p = 0.5, and 2A = 0.20 is desired for a 95% confidence interval. 


Then we can find n = 97 by Table 1. 


Table 1. DESIRED SAMPLE SIZE FOR 95% CONFIDENCE INTERVAL 


95% Confidence Experimenter’s Guess about Probability of Success 
Interval Size = 
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_ An alternative to this approach to finding sample size employs a prior 
distribution for P and Bayes’ theorem. In the next chapter we will describe the 
prior distributions, the binomial sampling distribution, and _ posterior 
distributions as they are related by Bayes’ theorem. We will also explain our 
rationale for selecting the two variants of the triangular distribution as our 


prior distributions. 


Hl A BAYESIAN METHOD FOR THE ESTIMATION OF PROPORTIONS, USING 
TRIANGULAR PRIORS 


Statistical inference and decision problems about proportions can be dealt 
with using Bayesian analysis. In the Bayesian approach to statistics, an 
attempt is made to utilize all available information (both sample and prior 
information) in order to reduce the amount of uncertainty present in an 
inferential or decision making problem. As new information is obtained, it is 
combined with any previous information to form the basis for statistical 
procedures. The formal mechanism used to combine the new information with 
the previously available information is Known as Bayes’ theorem. 

In Chapter Il we discussed the use of the classical method to estimate 
proportions. In this chapter we will study the three parts of a Bayesian method 
to determine the required sample size to estimate proportions: the prior 
distribution, the sampling distribution and the posterior distribution. The 
terms “prior” and “posterior” are relative to the observed information. Next 
we will discuss why we selected the triangular density function as our prior 
distribution and the binomial as our sampling distribution. Finally, we will 


derive our posterior distributions by using Bayes’ theorem. 


A. BAYES’ THEOREM 
We will use Bayes’ theorem throughout this work to estimate a proportion. 
Bayes’ theorem is a relatively minor extension of the definition of conditional 
probability. 
The typical phrasing of Bayes’ theorem is in terms of disjoint events 
A,. A;.....A,. whose union has probability one (i.e., one of the A, is certain to 
occur). Prior probabilities P(A,), for the events, are assume known. An event 


B occurs, for which P(B|A) (the conditional probability of B given A, ) is 
Known for each A,. Bayes’ theorem then states that 


p(A,| 8) =< Pee AD PA) 


>», P(B| A) P(A) 


j=1 


for any 1<i<n. These probabilities reflect our revised opinions about the 
A,, in the light of the knowledge that B has occurred. [Ref. 6: p. 129] 
The version which we will use is exactly analogous if we adopt the 
following changes: 
1. Replace P(A) with a probability density function, f(p), 
2. Replace summation, S, with integration, {, and 
3. Let X be the number of successes in n independent Bernoulli trials 


instead of B. 


The version of Bayes’ theorem [Ref. 7: p. 220] that results is 


f(X|P =p) f(p) 


f(p | X) = 
rx |P =p) f(p) dp 


| (aa 


where P is a continuous random variable with density function f(p) so that 


\" (sco = 1. 

We will look at Equation 3.1 in three parts. The density function f(p) 
represents the prior distribution ( before any sampling ), which will be one of 
the variants of the triangular density function in our work. The sampling 
function is f(X |P =p) that is binomial with the sample size n. The probability 
function f(p | X) is called the posterior distribution, obtained by combining the 
prior information f(p) and the sample information (X). At the same time, the 
mean of the posterior distribution is called the Bayesian estimate of P. 

In our continuous case, Bayes’s theorem can be expressed in words as 


. ae (Prior distribution) (Sampling distribution) 
Posterior distribution = ——_i—————o i “$_ $ —_  —_ .. 


(Prior distribution) (Sampling distribution) 


B. THE SELECTION AND USE OF PRIOR DISTRIBUTIONS 

The question is: How should one use the important prior information? 
Bayesian analysis allows effective use of prior information through statement 
of a prior distribution. The prior distribution should, of course, reflect the 
decision maker’s prior information, which may occur in a wide variety of 
states. Larson says: 


The prior distribution of a parameter P can be a probability function or 
probability density function expressing our degree of belief about the value 
of P, prior to observing a sample of a random variable X whose distribution 
function depends on P.[Ref. 1: p. 553] 
Different distribution functions can be characterized as “priors”. In this 
study we will use the two variants of the triangular density function, which are 


developed in the Appendix A, as our priors. They are as follows: 


(Pmax—P), fF 0<PSPmax <1 


2 
f,(p) = J Pmax (3.2) 
lo elsewhere, 


and 


_ 2 (Pmin =P) — for 0 “oe t, 


(Pp) - | (Prin im 1)° (3.3) 
0 elsewhere. 


We also derived the means of these two variants of the triangular density 


function in the Appendix A, which are respectively as follows: 


E,(P) il mak 


and 


E(P) = > (rnin se 


These two variants of the triangular density functions are graphed as 
functions of P in Figures 1 and 2 respectively. We also notice the symmetry 


between the functions shown in Figures 1 and 2. 


f(p) 


Figure 1. Triangular Density Function with Parameter (p,,,) 


f(p) 


Pm a 


Figure 2. Triangular Density Function with Parameter (p,,,) 


Note that when the triangular density functions have parameters p,., =1 
and p,,,=0 in Equation 3.2 and 3.3 respectively, they are special cases of the 


beta density function when «#=1, B=2 and when o=2, f=1. They are as 


follows: 


Cp |a=1,8=2)= 49 iil <del (3.4) 


oF elsewhere, 


and 


2D, OFS pe lr 


0, elsewhere. (3.5) 


f(p | 2=2,p=1=) 


In our study the sample information will be represented in terms of a 
sampling function by the binomial distribution. A random variable X has a 
binomial distribution with parameters n and p if X has a discrete distribution 


for which the probability function is as follows: 


M\ x n-x = for x = 0,1,2.....n, 
roxinpr={la)? —* 


0, elsewhere. 


(3.6) 


In this distribution n must be positive integer, p must lie in the interval 
O<p<1 and the variables x,,....x, form n Bernoulli trials with parameter p. 
[Ref. 826245} 

suppose that P, a random variable, represents the market share of a new 
brand of a certain product. The value of P is a proportion and can take on any 
value between 0 and 1. The new brand Is considerably different from the other 
brands of the product, so we are quite uncertain about the share of the market 
that it will attract. We think that it might attract virtually the entire market for 
the product (that is, P might be close to 1), it might not be successful at all 
(that is, P might be close to Q), or it might be moderately successful. Again 
we assume that P is continuous random variable. We think that low values of 
P are more likely than high values, and we assess a prior distribution for P that 
is a special case of a variant of the triangular density function (see Equation 
3.4). 

In the example of the market share we wish to obtain more information 
about P. A sample of five consumers of the product is taken; one purchases 


the new brand and the other four purchase other brands. We also assume that 


the process of purchasing this product has n independent Bernoulli trials. 
That is, the probability that a randomly selected consumer purchases the new 
brand is equal to P, the market share. The sample information can be 
represented by the binomial distribution that is as foilows : 


9 * 
(X|p) = ( Jou ay 


1 


This sampling distribution is graphed in Figure 3. 
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Figure 3. The Sampling Distribution in the Example of the Market Share 


Also, the prior distribution of this example is from Equation 3.4, and is 


illustrated in Figure 4. 
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Figure 4. The Prior Distribution in the Example of the Market Share 


Applying the version of Bayes’ theorem in Equation 3.1, we have the posterior 


distribution 


, 4 
foie (2 (=P) Opt aa 


| (2(1—p)) (5p (1—p)’) ap 
0 


p(1—py 


a 
| o« —p)’ dp 
0 


The denominator can be stated so that the integral is over a beta density 


function, yielding 


1 
r(2)r (6 r(s 
eN2)S): | ke 6) p*(1—p)°dp = ro 


r'(8) 2)I (6) 


Thus , the posterior density function of P is 


42 p(1—p)’, io p = 7, 
f = 
pales) i elsewhere. 


This posterior distribution is illustrated in Figure 5. 


f(plx) 


Figure 5. The Posterior Distribution in the Example of the Market Share 


15 


This example illustrates how the version of Bayes’ theorem in Equation 3.1 
provides a convenient way to revise density functions in terms of sample 


information, as shown by a comparison of Figures 4 and 9. 


C. GENERAL DERIVATION OF THE POSTERIOR DISTRIBUTION WHEN THE 
PRIOR DISTRIBUTION IS TRIANGULAR 

Using Bayes’ theorem, the prior information (represented by the prior 
distribution) and the sampling distribution are combined to form the posterior 
distribution of P. The posterior distribution summarizes our degree of belief 
of the location of P, given the results of the sample. Of course, the posterior 
distribution depends on the sampling function as well as on the prior 
distribution. 

At this point, we need to remember the definition of the beta distribution 
because we will use it in the remainder of this work. 


It is said that a random variable P has a beta distribution with parameters ~z 
and ff (a>Oand f >0) if P has a continuous distribution for which the p.d_f. 
f(p| «,f) is as follows : 


l (2 as p ) lll f 0 
(pl By = STAY OT cleo na 
0, 


[Ref. 8: p. 294]. 

In later equations, the density function f(p |, B) in Equation 3.7 will be 
shown as b(p: «, f}). Having defined the beta distribution for later work, we 
are ready to seek the posterior distributions that occur with prior triangular 
distributions and the binomial sampling distribution. 

1. Posterior Distribution with Prior Triangular Distributions Having 
Parameter Pmax 

First, we will derive the posterior distribution by using a prior 
triangular distribution with parameter p,,, which is going to be more heavily 
weighted in favor of low values of P rather than high values. Applying the 


version of Bayes’ theorem in Equation 3.1 to combine the prior triangular 


distribution in Equation 3.2 and the binomial sampling distribution in Equation 


3.6 , we have 





" -x_ 2 
( jer a p)" ; 9 (Prax =) 
S Pmax 


Co) a 
n\ n-x 2 
| ("p (i=) a (Pmax = 0) 10/8 


Pmax 


Pere OU — Pp < p.., = |. 


lf we cancel out some terms, we have 


p'(1 a (2) ea ae ==) 


Pmax 
GS hn, = nels 
“0 


f,(p | x)= 


lf we multiply terms in the denominator, our posterior becomes 


“(1 - saa max 
7 p Day Ve p) 


cee Pmax 

a 1 = 
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We multiply the denominator by the same terms to create the beta density 
functions. b(x +1,n—x +1), under the first integral, and b(x+2,n—x +1) under 


the second integral. Then the denominator is 
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By using the property of the gamma function that 
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the denominator becomes 
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lf we combine terms, we have the posterior distribution f,(p | x) as 
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When we multiply the terms in the numerator, multiply the second term by 
( x+1 n+2 

ee pe eee 
n+e X+ 1 
Pr (n+3)=(n+2) T (n+2) we will obtain two forms of the beta density function 


, and then use the property of the gamma function that 
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The posterior distribution, f,(p | x), becomes 
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We notice that the denominator has two forms of the beta cumulative 
distribution function with the same parameters «,, «, £, and ff, , which is 


shown as B(p,.,: «. £). Therefore we have 
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So finally, our posterior distribution becomes 
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where 
¢ pis the probability of success (O<p<p,., <1). 


e x is the number of successes that occurred in n independent Bernoulli 
trials, 
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Using a prior triangular distribution with parameter p,,., and a binomial 
sampling distribution, we have the posterior distribution in Equation 3.8. This 
posterior distribution shows that the size of the confidence interval for the 
proportion P depends upon the parameter of the prior (p,.,), the sample size 
n, and the number of successes x . 

We remember from the classical method that we need to know the 
number x of successes (before sampling) to determine the number n of 


samples. So, prior to sampling, we will make an assumption about x that it is 


equal to its expected value which is the mean of the prior triangular 


distribution with parameter p,.., multiplied by the number of samples, or 


= Pmax 
xXx = (ete n). 


Applying the result of this assumption into Equation 3.8, we have the 


posterior distribution as 
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one of the our posterior distributions becomes 


f(p x)= : Wore ol (eae B,) ae IN, D(p : dp. Bo) (3.9) 


Pmax 8(Pmax : 1 Bs)| — [Ni Bmax 3 2: Bo) | 


where o,.0,.), and ~, = 0) 

The final form of one of our posterior distributions continues, of course, 
to have two forms of the beta density function with parameters <j, «3, Bj, B3 in 
the numerator and two forms of the cumulative distribution function of the beta 
distribution with same parameters aj, «3, fj, 2; in the denominator. This 


means that existing computer programs for both the beta density function and 


the beta c.d.f. can be employed in our computationial work to relate desired 
confidence interval size to the sample size. 
2. Posterior Distribution with Prior Triangular Distributions with 

Parameter Pmin 

Now we will derive the posterior distribution by using a prior triangular 
distribution with parameter p,,,. Here, f(p) is going to be more heavily 
weighted in favor of high values of P rather than low values. Applying Bayes’ 
theorem from Equation 3.1 to combine the prior triangular distribution in 


Equation 3.3 and the binomial sampling distribution in Equation 3.6, 


i 
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where 0<p,,,.<p< 1. After we apply the same steps shown above in the 


derivation of the f,(p | x) , the posterior distribution, f,(p | x) , becomes 
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Here 
¢ pis the probability of success (0 <p,,, < p< 1), 


e x is the number of successes that occured in n independent Bernoulli 
trials, 
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Using a prior triangular distribution with parameter p,,, and a binomial 
sampling distribution, we have the posterior distribution in Equation 3.10. This 
posterior distribution shows that the size of the confidence interval depends 
upon the parameter of the prior (p,,,), the sample size n, and the number of 
SUCCESSES X. 

At this point, we also need to know the number x of successes (before 
sampling) to determine the number n of samples. So we will make an 
assumption about x that it is equal to the mean of the prior triangular 


distribution with parameter p,,, multiplied by the number of samples, or 
n 
eo (2 min +2). 
Applying the result of this assumption to the Equation 3.10, we now 


have the posterior distribution f,(p | x) as 
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then our second posterior distribution becomes 


[Prin B(P : 3, B3)| — [No b(p : a4. Ba) 


f,(p | x) = ee ee ee. aes) .|!lCU 
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, (3.11) 


where «3, «3, 23, and Bf, > 0. 

The final form of our second posterior distribution has also two forms 
of the beta density function with the parameters «3, «3, 3, By in the numerator 
and two forms of the beta cumulative distribution function with the same 
parameters «3, «3. 83, 2, in the denominator. 

We will use Equation 3.9 and Equation 3.11 as our posterior 
distributions in computer programs which were written in APL. 

In the next chapter we will present computer programs, tables, and 
graphs that can be used by a decision maker to determine the required sample 
size to obtain a desired size for a 95% Bayesian confidence interval for a 
probability or proportion. Also, we will show that in the Bayesian method, the 
desired sample size may be smaller than the sample sizes obtained by using 


the classical methods of Chapter Il. 
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IV. DETERMINING THE DESIRED SAMPLE SIZE TO ESTIMATE 
PROPORTIONS USING THE BAYESIAN METHOD 


In this chapter we will discuss the Bayesian method by making use of 
subjective probabilities measuring degrees of belief in order to determine the 
number of samples for proportions. As we mentioned in Chapter Ill, these 
probabilities are called the prior distribution. Thus when using the Bayesian 
method, this prior distribution summarizes the decision maker’s subjective 
degree of belief about the unknown values of proportions. When the decision 
maker has this subjective prior information (degree of belief) about bounds for 
the unknown proportion which can be described by prior triangular 
distributions having parameters p,., or p,i,, the sample size can be 
determined by using these triangular distributions, the desired confidence 
level. and the confidence interval size. 

First, we will discuss some of the considerations the decision maker might 
make which would lead to the selection of a prior triangular distribution. In the 
next section, after developing the relationship between the desired sample 
size and the decision maker’s prior information represented by one of the two 
variants of the triangular distribution, we will explain the tables which are 
related to the Bayesian interval sizes and the number of samples. Next, we 
will explain the graphs to determine the number of samples for proportions. 
Finally, we will discuss the Bayesian method using the prior triangular 
distribution in which the sample sizes may be smaller than the sample sizes 
obtained by using the classical method in Table 1 on Page 8. Throughout this 
chapter we will also explain how the computer programs work in determining 


the sample size to estimate proportions. 


A. DETERMINING THE PRIOR TRIANGULAR DISTRIBUTIONS AND THEIR 
PARAMETERS 
Under what conditions might one of the two forms of the triangular density 


function be reasonable as priors? We can use the forms of the triangular 


distribution as priors to represent skewing without having extensive additional 
information about the prior distribution. When the decision maker feels 
skewing is present, a triangular prior is an improvement over the prior uniform 
distribution, and the prior triangular distribution is also less complicated than 
a prior beta distribution where two parameters, which may be subjective, must 
be stated to fit the decision maker’s prior information. For a prior triangular 
distribution, for example, the only information needed from the decision maker 
is that low values of P are more likely than high values (e.g., P = proportion 
nonconforming) and a statement of p,,,,, which could be 1.0. Alternately, the 
decision maker may feel that high values of P are more likely, in which case 
he or she needs only a value for p,,,,, which could be 0. 

First, we consider a random variable P with the triangular density function 


of the form in Equation 3.2 which is as follows: 


Pmax 


2 
a fren (Dray —P), for 0 <P Pmax <1; 
1 — 
0, eisewnere. 


This prior triangular distribution is going to be more heavily weighted in favor 
of low values of P rather than high values. One way the decision maker can 
decide the parameter of this prior triangular distribution is by using the mean 


a and must be selected 


of this prior triangular distribution, which is E(P)= 
between 0 and 0.33333. For example, if the decison maker guesses the mean 


value as 0.2, the parameter can be found as follows: 
Pmaxy = S3E(P) = 0.6. 


At the same time, the value of parameter p,,., reflects the amount of positive 
skewing. 
Next, we consider the other form of the triangular density function in 


Equation 3.3 which is as follows: 


a> 


2) 3 
(Prin ~ 1) 


_ 2 (Pmin — P) for 0 <pri,<p <1, 
nie) =} 

0, elsewhere. 
This form of the triangular density function with parameter p,,,, is going to be 
more heavily weighted in favor of high values of proportions rather than low 
values. Again, one way by which the decision maker can decide the parameter 
of this prior triangular distribution is to use the mean of the prior triangular 
distribution, which is E(P)= > (Pmin +2) and must be selected between 0.66666 
and 1. For example, if the decison maker guesses the mean value E(P) as 0.8, 


the parameter can be found as follows: 
Dain = Joe eee — era 


At the same time, the value of parameter p,,,, reflects the amount of negative 
skewing. 

These examples show how triangular distributions allow the decision 
maker to express prior beliefs about the values of P when only limited 
information is available. We will next explain how we find the Bayesian 


bounds and interval sizes. 


B. FINDING THE BAYESIAN INTERVAL SIZES AND BOUNDS 
We gave the derivation of the posterior distributions in Chapter Ill when 
the priors are in the form of the triangular density function. Results from these 


posterior distributions are as follows: 
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where «3, «}, {3, and ~; >0. In these expressions we assumed that the number 
of successes is equal to the mean of the prior distribution times n. The 
posterior distributions in the above forms are here for the purpose of finding 
sample size. As we mentioned in Chapter Ill, they are in the form of linear 
combinations of two beta density functions in the numerator, and the linear 
combination of two beta cumulative distribution functions in the denominator. 
The parameters are functions of the sample size n, and p,., OF Pm The 
denominators are not functions of the random variable P. 

If we specify a value for the sample size n and a value for p,,,, Or Px, We 
can compute (for, say, a 95 percent confidence level) the cumulative 
distribution functions at 0.025 and 0.975 for the posterior density functions. 
We notice that the term in the denominator is constant for given p,., and n. 


For example, the cumulative distribution functions of the posterior 


distribution f,(p x=) at 0.025 and 0.975 are 
p.lo p.lo 
Pmax| b(p)dp — n,| b(p)dp 
ee ee eee _ 0,005. (4.1) 


Pmax Pmax 
Pmax b(p; a4, B4)dp — mf b(p; a2, B2)ap 
0 
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and 


p.up — p.up 
Pman| b(p)dp — n,{ b(p)dp 
F.(p.up) = —————___—____._______— = 0.975. (4.2) 


Pmax Prmax 
Pes| b(p; «4, B4)dp — mf b(p; ao, Ba)dp 
0 0 


Using the above equations we can obtain the lower and upper bounds of the 
95 percent Bayesian confidence interval that would result had the number of 
successes in the sample been equal to the mean number from the prior 
distribution. Finally, we will find the interval sizes by subtracting the upper 


bound from the lower bound. 
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For the above procedure, we developed the APL programs named 
PMINIMUM and PMAXIMUM, which are given in Appendix B and C, 
respectively. The program, PMINIMUNM, is for the posterior distribution with 
the prior triangular density function having parameter p,,,,, and PMAXIMUM is 
for the other posterior distribution with the prior triangular distribution having 
parameter p,,,. Both programs are the main programs used in our analysis. 
Each program is interactive and the user is required to enter the bound of the 
prior triangular distribution (p,., Or P,i, ) and the sample sizes. 

For example, the APL program, PMAXIMUM, computes the upper bounds, 
the lower bounds, and the Bayesian interval sizes using Equations 4.1 and 4.2. 
Also it uses the APL program BETA which was designed at the Naval 
Postgraduate School to compute the beta density function. This program is 
given in Appendix D. It should be noticed that if the total parameter value of 
any of the beta distributions in the posterior distributions exceeds 255 ( 1.e., 
w+ f,;> 255 or 4, +f; = 255 ), BETA cannot compute the beta density 
function. 

In the next section, we will explain how the decion maker can use these 


tables. 


C. DETERMINING THE SAMPLE SIZES WITH TABLES 

Decision makers can use tables to facilitate their determination of the 
sample size using the prior triangular distribution and a desired confidence 
fever 

Let us explain with examples how the decision maker can use these tables. 
Suppose that the decision maker’s prior triangular distribution parameters are 
Pmax= 1 Or p,;, =0. Also suppose that the decision maker desires the Bayesian 
interval size (2A) to be 0.20 for estimating the proportion, with a 95 percent 
confidence level. The APL programs create tables similar to Tables 2 and 3 
using the parameters p,,=1 or p,,,=0, respectively. Then the decision 
maker can find the sample size, 81, from Table 2 or 3. This is the desired 
sample size n that reflects both the decision maker’s subjective bounds and a 


95 percent confidence level. 
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As we mentioned in Chapter III, the prior triangular distributions having 
parameter p,., = 1 and p,,,=0 are symmetric which also holds for p,,,=0.8 
and p,,, =0.2 or other situations. Therefore, we notice that their Bayesian 
interval sizes are the same (see Table 2 and 3), but also we should notice that 
they have different lower and upper bounds with the same Bayesian interval 


size. 


Table 2. SAMPLE SIZES AND BAYESIAN INTERVALS USING THE TRIANGULAR 
PRIOR DISTRIBUTION WITH PARAMETER PMAX= 1.0 


Sample Size Genomeanne Bayesian | Interval Size 
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Table 3. SAMPLE SIZES AND BAYESIAN INTERVALS USING THE TRIANGULAR 
PRIOR DISTRIBUTION WITH PARAMETER PMIN=0.0 


Sonne Size Upper Bound Saves MNS size 


Pott tio 97468 
P21 | 96038 | 7087 
Ps 283s | 94730 
P4095 | 0.9353 TGA 
— 5 03310 | 9240935 
6 | osaon | tas 05S 
83783 | 980 | IT 
9 | os903 | 90705005 


Oy, HR, O};M] — 


0.5553 0.7694 0.2140 
en 0.5624 0.7634 0.2010 


| go | es2 | ses | 1901 


0.5813 0.7469 0.1656 
0.5846 0.7440 0.1593 


0.5876 0.7413 0.1537 
0.5903 0.7389 0.1487 
0.5927 0.7368 0.1441 
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It is true that 
e Upper Bound (in Table 2) = 1 - Lower Bound (in Table 3) or 
e Lower Bound (in Table 2) = 1 - Upper Bound (in Table 3). 


Therefore after obtaining the tables with the triangular density function using 
Pmin It is easy to obtain other tables with the triangular density function using 
Pmax: 

Tables such as Table 2 and Table 3 with various values of p,., and p,,;, that 
can be used to determine the sample size needed to produce a desired 
confidence level to estimate a proportion are located in Appendices E and F. 
In the next section, we will explain how the decision maker can use graphs to 


determine the sample size to estimate proportions. 


D. DETERMINING THE SAMPLE SIZES WITH GRAPHS 

In this section, we will provide graphs to assist in determining the number 
of samples by using triangular distributions with various parameters as priors. 

Programs PMINIMUM and PMAXIMUM create vectors in the APL 
workspace of the lower bounds, the upper bounds, and Bayesian interval 
sizes. If we plot the vector of the sample sizes versus the vector of the lower 
bounds and the upper bounds, we can obtain the graph illustrated in Figure 
6. From Figure 6, it can be seen that when the sample sizes increase, the 
Bayesian interval sizes decrease. 

lf we plot the vector of the sample sizes versus the size of the 95 percent 
Bayesian interval we obtain the graph illustrated in Figure 7. Let us explain 
with an example how the decision maker can use this graph. Suppose that the 
decision maker’s prior triangular distribution parameter is p,,,=0. In addition, 
suppose that the decision maker desires the Bayesian interval size to be 0.20 
with a 95 percent confidence. First, the decision maker, using Figure 7, must 
find 0.20 on the ordinate and then move across the graph to where 0.20 
intercepts the curve. The decision maker can then read the sample size, 
approximately 81, on the abscissa. 

The decision maker can find graphs in Appendix G with various 


parameters of the prior triangular distribution to determine the sample sizes 
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Figure 6. Number of Samples vs the Bounds of the 95% Bayesian Interval with 


a Triangular Prior Distribution Having Parameter Pmin=0 


needed to obtain a desired 95 percent confidence level for proportions. In the 
next section we will study the sensitivity of sample size to the bounds in the 


prior distribution. 


E. SENSITIVITY OF SAMPLE SIZE TO THE PARAMETERS IN THE PRIOR 
DISTRIBUTION 

At this point the decision maker may be interested in this question: how 
will variations in the prior distribution affect the ultimate decision? Or, in other 
words, how sensitive is the sample size n to the (possibly) guessed value of 
the bound (p,.., or p,j,) in the prior distribution? To answer these questions 
we will change p,,,, or p,,, with the sample size n held constant, and we will 
look at the values of the Bayesian interval size 2A. For example, if we want the 


Bayesian interval size 2A to be 0.2 and guess p,,, =0.2, we obtainn=/70. Now 
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Figure 7. Number of Samples vs the Size of the 95% Bayesian Interval with a 


Triangular Prior Distribution Having Parameter Pmin=0 


if we change our guess, e.g., p,,,=0.1, and use n=/70, we obtain 2A=0.2079. 
Erroring in the other direction, for p,,, =0.3 and n=/70, we obtain 2A=0.195. 
Thus for this example, the sample size n and the choice of bounds p.., or pf, 
appear to be relatively insensitive. When the sample sizes are bigger, the 
sample size n is reasonably insensitive to the choice of guessed values of p,,,, 
Sige (see Figure 8). 


Next, we will make a comparison between methods. 


F. COMPARISON OF THE CLASSICAL METHOD AND THE BAYESIAN 
METHOD USING THE TRIANGULAR PRIOR DISTRIBUTION 
Let us compare the results obtained by using our approach versus the 


classical method. The classical method for determining the desired sample 
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Figure 8. The Sensitivity of C.l. Size to the Guessed Value of Pmin 


sizes to estimate proportions requires the decision maker’s guess about the 
probability of success and the length of the confidence interval (2A). The 
Bayesian method, our alternative approach, needs 2A and the bounds or 
parameters of the prior density function and direction of skewing. Also, in this 
discussion we note that 
For any finite sample size, the Bayesian estimate is ‘shaded’ toward the 
prior mean, the best guess for P before any sample values were taken. This 
effect disappears as n increases indefinitely.[Ref. 1: p. 566] 

Suppose that the bound of the prior triangular distribution is p,,,=0 and 
the desired interval size (2A) is 0.20. Then we find E(P)= 0.6666 as the mean 
of this prior. Suppose that this mean value from the Bayesian method is equal 
to the decision maker’s guess about the probability of success in the classical 
method. By using this value in Table 1 on page 8, the classical method 


requires 86 as the sample size. At the same time, if we use the tables based 
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on the Bayesian method we find the sample size to be 81 from Table 3. Other 
examples are shown in Table 4. 

50, if we compare the results of these two methods which meet the above 
requirements, we realize that the results of the Bayesian method are quite 
favorable to those obtained using the classical method. When the values of 
the sample sizes are smaller, the values of the sample sizes based on 
Bayesian method are quite different. This is apparent from the results in Table 
4. Larson says, “The difference between the Bayesian values, and the 
classical approach, disappears as n increases” [Ref. 1: p. 573]. Also, we 
realize that when the sample sizes become larger, the posterior distribution 
becomes less dependent on the subjective prior information and more 


dependent on the objective sample information. 


Table 4. COMPARISON OF THE CLASSICAL AND THE BAYESIAN METHODS 


Size of Sample Size sample Size 
pets from Bayesian from Classical 
Method Method 





In the next chapter, we will summarize our study, and we will give some 


suggestions for further research and study. 
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V. SUMMARY AND SUGGESTIONS FOR FURTHER RESEARCH AND STUDY 


Determination of the number of items to be tested remains an important 
problem in military operational test and evaluation, particularly when a 
proportion, such as an item’s reliability, is to be estimated. One well known 
approach is to state a requirement for the size of the confidence interval that 
will estimate the proportion, and then use that requirement to determine the 
sample size. 

Application of Bayesian statistics (which employ prior information about 
the proportion to be estimated) can reduce the number of observations 
needed. Floropoulos [Ref. 3] studied the case where, prior to sampling, the 
decision maker might be able to bound the proportion, but was uncertain 
about it otherwise. Manion [Ref. 2] examined the sample size determination 
question when enough prior information was available to specify a beta 
distribution as a prior. The study in this thesis looked at the case where there 
was more information present than the uncertainty represented by a uniform 
distribution, but not enough detail to use a beta prior. 

In this chapter, we will summarize how we used the Bayesian method with 
the triangular priors to obtain the sample sizes to estimate proportions. 


Finally, we will give some suggestions for additional studies. 


A. SUMMARY 

Throughout this study, we described the Bayesian method to determine 
the desired sample size that is needed to estimate proportions with a (1-«) 100 
confidence when a prior distribution is given to a proportion. 

First, we described a classical method to determine the sample size for 
estimating proportions using confidence intervals. Then, we described an 
alternative to this approach which was the Bayesian method. We studied the 
three parts of this Bayesian method: the prior distribution, the sampling 
distribution, and the posterior distribution. When using the Bayesian method, 


the prior distribution expresses the decision maker’s degree of belief of the 
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location of proportion P prior to sampling, and the posterior distribution 
expresses the decision maker’s degree of belief of the location of proportion 
P given the results of the sample. We developed and used the two forms of the 
triangular density function as our priors. Using Bayes’ theorem, we combined 
these two prior triangular distributions and the binomial sampling distribution 
to form the posterior distributions. The forms of these posterior distributions 
had two forms of the beta density function in the numerator and two forms of 
the beta cumulative distribution function in the denominator. Then using these 
two posterior distributions, we developed computer programs, tables, and 
graphs that can be used by a decision maker to determine the desired sample 
size to obtain a 95 percent confidence level to estimate proportions using 
Bayesian intervals. We also explained how the decision maker might select 
the prior triangular distributions and their bounds, and how decision makers 
can use tables and graphs to facilitate their determination of the sample size 
in some decision making applications. 

Finally, we showed that results from the Bayesian method are quite 
favorable to those obtained using the classical method. When the values of 
the sample size are small, the values of the sample sizes based on the 
Bayesian method are quite an improvement. We also showed that when the 
values of the sample size are large, the sample size n is reasonably insensitive 
ieine Choice of bounds, P,.., OF P,,,: 


In the next section we will suggest some additional studies. 


B. SUGGESTIONS FOR FURTHER RESEARCH AND STUDY 

In previous studies, the prior beta distribution allowed better control of the 
representation of the decision maker’s prior beliefs, while the prior uniform 
distribution did not provide a great deal of flexibility. im our study, prior 
triangular distributions did not provide exceptional flexibility in selecting 
priors but we realized that the use of prior triangular distributions is less 
complicated than using the prior beta distribution. In other words, these two 
prior triangular distributions can be used when estimations about proportions 


are made about the minimum or maximum values of the random variable P 


oF 


and the skewing of the prior. At this point we suggest that when estimations 
are made about the modal values of the random variable, another prior 
density function could have a different triangular shape with the following 


‘5/10 /pe 


2(x — a) ee 
(b—a)(C —a) © SS ied 
f(x) _— 2(c = x) . 
(¢ — Be =) eae 
O, elsewhere, 


where a<b<c. Here, note that both upper and lower bounds can be 
changed at the same time. 

Also, in addition to 95 percent confidence, all tables and graphs could be 
developed using other confidence levels, such as 90%, 97.5%, and 99%. An 
additional study which could be made would determine the number of samples 
for estimating proportions if nonparametric methods are to be used. 

It is hoped that the work presented here will be useful to experimenters, 
decision makers, and test planners in deciding how big a sample. or how many 


trials must be done, in order to estimate a proportion or a probability. 
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APPENDIX A. DERIVATION OF THE TWO VARIANTS OF THE TRIANGULAR 
DENSITY FUNCTION 


We will use basic calculus to derive the two variants of the triangular 
density function. Both density functions are linear functions, one with negative 
slope for random variable P defined O<p<p,.<1, and the other with 
positive slope forO<p,,,< p< 1. First, we will give the definition of slope that 
is as follows: 

lf P, = (x,. y,) and P, = (x,, y.) are points on a nonvertical line 7, the slope 
of “ is defined by the ratio [Ref. 9: p. 19] 

i 2. (A.1) 
Then we will define the equation of 7. Suppose /¢ is a line with slope m which 
contains the point (x,. y,). To find the equation for “ we let p=(x,y) be an 


arbitrary point on @. Then, by Equation A.1. we obtain 


ee eee 
Ms XG 
SO 
Vane (OX axa) (A.2) 


Now we can find the equation of the first variant of the triangular 
distribution with parameter p,,., for the line through (0,a) and ( p,,,, ,0) (see 


Figure 1). First, we will find the slope by using Equation A.1, that is 





where 0<p<p,, <1 and 2<a<oo. Using the point (0,a) and the slope 
a 
Pmax 





n= in Equation A.2 then gives 


ay 





a Se (PST) 


Pmax 


Setting y=f(p). we then have 
a | 
f(p) = 1. (Pmax — P). (A.3) 
max 


We will find an equation for “a” by using the following property of density 


functions, 


Pmax a 
| an. (Omar aoe — 
; max 


which tmplies that 


2 


Pmax 





a = 


Substituting “a” into Equation A.3, we have the triangular density function as 


f(p) = = (pmax —P). (A.4) 


Pmax 


Where O02 p ea aale 

Also, we need to remember the definition of the expectation for a 
continuous distribution to find the means of the two variants of the triangular 
density function. If a random variable P has a continuous distribution for 
which the p.d.f. is fio) then the expectation E(P) is defined [Ref 8: p. 180] as 


follows : 


E(P) = \" p flp) dp. (A.5) 


—OoO 


The mean of a random variable with the density function of Equation A.4 is 
then 
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| Pmax a p | 
A) | P —Z— (Pmax—P) dp = 3 (A.6) 
0 


Pmax 


Now we will find the equation of the second variant of the triangular 
distribution with parameter p.,, for the line through (1,a) and ( p,,,, ,0) (see 


Figure 2). First, we will find the slope by using Equation A.1, that is 


Where 0<p,,,5p<1and 2<a<oo. Using the point (1,a) and the slope 


= = — in Equation A.2 then gives 


(pe. =i) 


a 


ae) ee Pee |): 
(Pin =) 
Setting y =f(p). we then have 
a ‘ 
ee (Pmin = P)- (A.7) 
Pmin 


Again we will find another equation for “a” by using the following property 


1 
a 
| | (Dean —p)dp ais UF 
Pmin Prin 


which results in 


Z 


a SS 
eae On 


Substituting “a” into Equation A.7, we have the density function 


a (Pmin 7 p) 


| (A.8) 
Oba - iL 


GG 


4] 


where 0<p,,,<p<1. The mean is 


—2(Pmin — 
cP) = | p — ome dp = Lain + 2) (A.9) 


2 
Pmin (Pimin i) 
The above two variants of the triangular distribution are used as our priors 
to derive the posterior distribution based on Bayes’ theorem in Chapter III of 


this thesis. 


APPENDIX B. THE APL PROGRAM USED TO COMPUTE BAYESIAN 
INTERVALS WITH THE TRIANGULAR PRIOR DISTRIBUTION HAVING 
PARAMETER PMAX 


V PMAXIMUM 
THIS PROGRAM COMPUTES UPPER BOUNDS, LOWER BOUNDS AND BAYESIAN 
THE NUMBER OF SAMPLES TO ESTIMATE 
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APPENDIX C. THE APL PROGRAM USED TO COMPUTE BAYESIAN 
INTERVALS WITH THE TRIANGULAR PRIOR DISTRIBUTION HAVING 
PARAMETER PMIN 





V PMINIMUM 

fa] a THIS PROGRAM COMPUTES UPPER BOUNDS, LOWER BOUNDS AND BAYESIAN 
[21 a CONFIDENCE INTERVALS TO DETERMINE THE NUMBER OF SAMPLES USING 
(31 a PRIOR TRIANGULAR DISTRIBUTION (OSPMINSP<1). IT ASK THE USER 
ful a TO INPUT THE VECTOR OF SAMPLE SIZE, THE PARAMETER OF PRIOR 
[5] a TRIANGULAR DISTRIBUTION AND CONFIDENCE LEVEL. IT USES PROGRAM 
i" a BETA AS SUBROUTINE. 
8] Qe ENTER SAMPLE SIZE! 

< 
[10] O+'ENTER BOUND OF PRIOR TRIANGULAR DISTRIBUTION' 
[11] PMi+«Q 
[i129 SsAarco 
[13] LOOP1:SAY+«SAY+1 
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APPENDIX D. THE APL PROGRAM USED TO COMPUTE THE BETA DENSITY 
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APPENDIX E. TABLES THAT CAN BE USED TO DETERMINE SAMPLE SIZES 
BY USING THE PRIOR TRIANGULAR DISTRIBUTION WITH VARIOUS PMIN 
PARAMETERS 


Table 5. SAMPLE SIZES AND BAYESIAN INTERVALS USING THE TRIANGULAR 
PRIOR DISTRIBUTION WITH PARAMETER PMIN=0.0 


Sample Size Upper Bound | Bayesian Interval Size 


pa T3095 | 09353 | 06258 
2: | 
Pp 8 | 7es | 980 IT 
_ 9 | 3903 | 8907 | 05005 
- 60) | sae | rev | 0.2300 
| 80624 | 634 | 2010 
/ 905682 | 583 | 901 
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Table 6. SAMPLE SIZES AND BAYESIAN INTERVALS USING THE TRIANGULAR 
PRIOR DISTRIBUTION WITH PARAMETER PMIN=0.1 


3 
4 


| SSS Se en eee a 
—_ Ee 
SE a a a ey 
2 a ee a ee ae 
_ ot Se ee ee 
0.6316 
0.6334 


Table 7. SAMPLE SIZES AND BAYESIAN INTERVALS USING THE TRIANGULAR 
PRIOR DISTRIBUTION WITH PARAMETER PMIN=0.2 


Sane lg Size Upper Bound a SIZE 


pot sasg 09815 65H 
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em 
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60 | 175 | 08325 | 2150 
“80 | e330 | 210 | 880 
p90 385 |S T0781 
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Table 8. SAMPLE SIZES AND BAYESIAN INTERVALS USING THE TRIANGULAR 
PRIOR DISTRIBUTION WITH PARAMETER PMIN=0.3 


| Eee een eer 
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Table 9. SAMPLE SIZES AND BAYESIAN INTERVALS USING THE TRIANGULAR 
PRIOR DISTRIBUTION WITH PARAMETER PMIN=0.4 


Sample Size Upoemooune Bayesian ata Size 


7 
| 2015099 oe eee 
a eee ee ee 
_ 4 | 05259 | ee a ee 
en ee ana 
ee ee er a a 


O17, RCM] — 


| —m ./5 0.5565 0.9565 0.4000 
oo 0 0.5634 0.9530 0.3896 


| 60 |S 919 esse aise 
| 80 | 07065 e755 ese 
___90 | 0c) 2 er 
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Table 10. SAMPLE SIZES AND BAYESIAN INTERVALS USING THE 
TRIANGULAR PRIOR DISTRIBUTION WITH PARAMETER PMIN=0.5 


__ 5S ee ee 


___ 3 J Cee nee eee 
EE 
nee, Oreo a OS on 
be eee ae ee a 
__ 3 EE eee a ee ara 


0.7887 0.8723 0.0836 


Table 11. SAMPLE SIZES AND BAYESIAN INTERVALS USING’ THE 
TRIANGULAR PRIOR DISTRIBUTION WITH PARAMETER PMIN=0.6 


|G 06790 2 ee 
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| ee ee ee eee 
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0.8286 0.8992 0.0707 
0.8311 0.8973 0.0662 
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Table 12. SAMPLE SIZES AND BAYESIAN INTERVALS USING’ THE 
TRIANGULAR PRIOR DISTRIBUTION WITH PARAMETER PMIN=0.7 


Sample Size WepeiBewAd Bayesian Interval Size 


Nn 
te 0./565 0.98/78 0.2313 
Deki 9 0.9867 0.2288 


0./593 ORMESS 0.2262 


0.7994 0.9633 0.1639 


0.8717 0.9240 0.0523 
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Table 13. SAMPLE SIZES AND BAYESIAN INTERVALS USING. THE 
TRIANGULAR PRIOR DISTRIBUTION WITH PARAMETER PMIN=0.8 
Sample Size Bayesian Interval Size 
0.9970 
0.9965 
0.9960 0.1630 
0.9954 
0.8340 0.9949 0.1609 
rn ae ae 0.9943 0.1598 
0.9938 
ow 0.8356 0.9932 0.1576 
|. G a ase a 0.9927 0.1566 
0.9922 0.1555 
i eee ek ee aa ee aaa ae 
Mn Dn a Sea ees 
0.8683 0.9738 0.1055 
| 80 seo ae 0.9723 0.1004 
| OO OS ae 0.9709 
0.9696 0.0917 
0.8804 0.9685 
0.9665 0.0819 
0.9657 0.0803 
0.8871 0.9649 0.0778 
0.9641 0.0754 
0.8901 0.0733 
0.9628 0.0713 
0.8926 0.9622 0.0695 
0.9616 
0.9606 
0.8975 0.9596 
0.9592 0.0609 
0.8990 0.0598 
0.9584 0.0587 
0.9004 0.9580 
0.9010 0.0567 
0.9573 0.0558 
0.9559 
| Sd 400 —S—“‘L C9062” esse eee ee eee 
0.9536 
0.9527 0.0434 
0.9105 0.9519 
= 600) 2 emit) 94 ee 0.9512 0.0397 
0.9124 0.9506 0.0381 
0.9132 0.9500 0.0368 
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APPENDIX F. TABLES THAT CAN BE USED TO DETERMINE SAMPLE SIZES 
BY USING THE PRIOR TRIANGULAR DISTRIBUTION WITH VARIOUS PMAX 
PARAMETERS 


Table 14. SAMPLE SIZES AND BAYESIAN INTERVALS USING THE 
TRIANGULAR PRIOR DISTRIBUTION WITH PARAMETER PMAX= 1.0 
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Table 15. SAMPLE SIZES AND BAYESIAN INTERVALS USING’ THE 
TRIANGULAR PRIOR DISTRIBUTION WITH PARAMETER PMAX= 0.9 
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Table 16. SAMPLE SIZES AND BAYESIAN INTERVALS USING’ THE 
TRIANGULAR PRIOR DISTRIBUTION WITH PARAMETER PMAX= 0.8 


Yk 
1 
po forse | 333 |S 


p60 | te75 | 3825 | 2150 
p80 | 90. | e700 
p90 tesa 08615 BT 


oY 


Table 17. SAMPLE SIZES AND BAYESIAN INTERVALS USING’ THE 
TRIANGULAR PRIOR DISTRIBUTION WITH PARAMETER PMAX=0.7 
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Table 18. SAMPLE SIZES AND BAYESIAN INTERVALS USING’ THE 
TRIANGULAR PRIOR DISTRIBUTION WITH PARAMETER PMAX= 0.6 


Sample Size Waoae Saun Bayesian val Size 


0.0122 0.4978 0.4856 
0.0172 0.4901 0.4729 
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Table 19. SAMPLE SIZES AND BAYESIAN INTERVALS USING THE 
TRIANGULAR PRIOR DISTRIBUTION WITH PARAMETER PMAX=0.5 


Sample Size oper eeu Bayesian Interval Size 
Nn — 
0.0095 0.4167 0.4072 
0.0129 0.4121 0.3992 
0.0164 0.4073 0.3909 
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Table 20. SAMPLE SIZES AND BAYESIAN INTERVALS USING THE 
TRIANGULAR PRIOR DISTRIBUTION WITH PARAMETER PMAX=0.4 


Sample Size Sonemeouns Bayesian Interval Size 


0.3344 
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Table 21. SAMPLE SIZES AND BAYESIAN INTERVALS USING’ THE 
TRIANGULAR PRIOR DISTRIBUTION WITH PARAMETER PMAX=0.3 


Sample Size Terree Benteg Upper Bound Bayesian Interval Size 
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Table 22. SAMPLE SIZES AND BAYESIAN INTERVALS USING THE 
TRIANGULAR PRIOR DISTRIBUTION WITH PARAMETER PMAX= 0.2 


Seipple Size Upper Bound FEN ES EIA CINEL Size 
0.1675 0.1640 
0.0040 0.1670 
0.1619 
0.1660 


0.0062 
| SS Ee eee eee a |<) 
en ee 00 ee 0639 TSG 
0.1457 
0 EE aE ee ee a oe 
a Se 0.0277 
0.0881 

0.0803 


0.0389 
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0.0500 0.0868 0.0368 
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APPENDIX G. GRAPHS THAT CAN BE USED TO DETERMINE SAMPLE SIZES 
BY USING PRIOR TRIANGULAR DISTRIBUTION WITH VARIOUS PARAMETERS 
PMIN OR PMAX 
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SAMPLE SIZE n 


Figure 9. Number of Samples vs the Size of the 95% Bayesian Interval with a 


Triangular Prior Distribution with Pmin=0 or Pmax=1 
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Figure 10. Number of Samples vs The Size of the 95% Bayesian Interval with 


a Triangular Prior Distribution with Pmin=0.1 or Pmax=0.9 
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Figure 11. Number of Samples vs The Size of the 95% Bayesian Interval with 


a Triangular Prior Distribution with Pmin=0.2 or Pmax=0.8 


66 


0.5 


q 
< 
eo 
tw © 
2 
a 
< 
it 

Mw) 
< S 
ie 
Se 
lid 
N 
WN 

S 

o 

0 50 100 150 200 
SAMPLE SIZE n 


Figure 12. Number of Samples vs The Size of the 95% Bayesian Interval with 


a Triangular Prior Distribution with Pmin=0.3 or Pmax=0.7 
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Figure 13. Number of Samples vs The Size of the 95% Bayesian Interval with 


a Triangular Prior Distribution with Pmin=0.4 or Pmax=0.6 
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Figure 14. Number of Samples vs The Size of the 95% Bayesian Interval with 


a Triangular Prior Distribution with Pmin=Pmax=0.5 
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Figure 15. Number of Samples vs The Size of the 95% Bayesian Interval with 


a Triangular Prior Distribution with Pmin=0.6 or Pmax=0.4 
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Figure 16. Number of Samples vs The Size of the 95% Bayesian Interval with 


a Triangular Prior Distribution with Pmin=0.7 or Pmax=0.3 
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Figure 17. Number of Samples vs The Size of the 95% Bayesian Interval with 


a Triangular Prior Distribution with Pmin=0.8 or Pmax=0.2 
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